Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Peripheral and global features for use in coarse classification of Chinese characters

Identifieur interne : 002378 ( Main/Exploration ); précédent : 002377; suivant : 002379

Peripheral and global features for use in coarse classification of Chinese characters

Auteurs : Kuo-Sen Chou [République populaire de Chine] ; Kuo-Chin Fan [République populaire de Chine] ; Tzu-I Fan [République populaire de Chine]

Source :

RBID : ISTEX:9EBC45851BF53BF96B2314C37DDCD3A03AD5B9AA

Abstract

In this paper, a simple and effective approach to the coarse classification of handwritten Chinese characters is proposed. In our approach, a Chinese character is characterized by string representation using periphery and global feature vectors. The peripheral features include four strings to represent the structure of segments in top, bottom, left, and right directions. The global features include the number of horizontal segments in the top direction and bottom direction, and the number of stroke segments in a character. In addition, a scoring-based coarse classification scheme is devised in choosing the proper candidate characters. Twenty sets of Chinese characters (5401 characters/set) are tested. The number of candidate characters is reduced from 5401 to about 80 with the error rate less than 1.2% in average. Experimental results reveal the feasibility of the proposed approach in classifying Chinese characters.

Url:
DOI: 10.1016/S0031-3203(96)00090-8


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title>Peripheral and global features for use in coarse classification of Chinese characters</title>
<author>
<name sortKey="Chou, Kuo Sen" sort="Chou, Kuo Sen" uniqKey="Chou K" first="Kuo-Sen" last="Chou">Kuo-Sen Chou</name>
</author>
<author>
<name sortKey="Fan, Kuo Chin" sort="Fan, Kuo Chin" uniqKey="Fan K" first="Kuo-Chin" last="Fan">Kuo-Chin Fan</name>
</author>
<author>
<name sortKey="Fan, Tzu I" sort="Fan, Tzu I" uniqKey="Fan T" first="Tzu-I" last="Fan">Tzu-I Fan</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:9EBC45851BF53BF96B2314C37DDCD3A03AD5B9AA</idno>
<date when="1997" year="1997">1997</date>
<idno type="doi">10.1016/S0031-3203(96)00090-8</idno>
<idno type="url">https://api.istex.fr/document/9EBC45851BF53BF96B2314C37DDCD3A03AD5B9AA/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">001729</idno>
<idno type="wicri:Area/Istex/Curation">001634</idno>
<idno type="wicri:Area/Istex/Checkpoint">001828</idno>
<idno type="wicri:doubleKey">0031-3203:1997:Chou K:peripheral:and:global</idno>
<idno type="wicri:Area/Main/Merge">002508</idno>
<idno type="wicri:Area/Main/Curation">002378</idno>
<idno type="wicri:Area/Main/Exploration">002378</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a">Peripheral and global features for use in coarse classification of Chinese characters</title>
<author>
<name sortKey="Chou, Kuo Sen" sort="Chou, Kuo Sen" uniqKey="Chou K" first="Kuo-Sen" last="Chou">Kuo-Sen Chou</name>
<affiliation wicri:level="1">
<country xml:lang="fr">République populaire de Chine</country>
<wicri:regionArea>Institute of Computer Science and Information Engineering, National Central University, Taiwan</wicri:regionArea>
<wicri:noRegion>Taiwan</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Fan, Kuo Chin" sort="Fan, Kuo Chin" uniqKey="Fan K" first="Kuo-Chin" last="Fan">Kuo-Chin Fan</name>
<affiliation wicri:level="1">
<country xml:lang="fr">République populaire de Chine</country>
<wicri:regionArea>Institute of Computer Science and Information Engineering, National Central University, Taiwan</wicri:regionArea>
<wicri:noRegion>Taiwan</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Fan, Tzu I" sort="Fan, Tzu I" uniqKey="Fan T" first="Tzu-I" last="Fan">Tzu-I Fan</name>
<affiliation wicri:level="1">
<country xml:lang="fr">République populaire de Chine</country>
<wicri:regionArea>Institute of Computer Science and Information Engineering, National Central University, Taiwan</wicri:regionArea>
<wicri:noRegion>Taiwan</wicri:noRegion>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="j">Pattern Recognition</title>
<title level="j" type="abbrev">PR</title>
<idno type="ISSN">0031-3203</idno>
<imprint>
<publisher>ELSEVIER</publisher>
<date type="published" when="1996">1996</date>
<biblScope unit="volume">30</biblScope>
<biblScope unit="issue">3</biblScope>
<biblScope unit="page" from="483">483</biblScope>
<biblScope unit="page" to="489">489</biblScope>
</imprint>
<idno type="ISSN">0031-3203</idno>
</series>
<idno type="istex">9EBC45851BF53BF96B2314C37DDCD3A03AD5B9AA</idno>
<idno type="DOI">10.1016/S0031-3203(96)00090-8</idno>
<idno type="PII">S0031-3203(96)00090-8</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0031-3203</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">In this paper, a simple and effective approach to the coarse classification of handwritten Chinese characters is proposed. In our approach, a Chinese character is characterized by string representation using periphery and global feature vectors. The peripheral features include four strings to represent the structure of segments in top, bottom, left, and right directions. The global features include the number of horizontal segments in the top direction and bottom direction, and the number of stroke segments in a character. In addition, a scoring-based coarse classification scheme is devised in choosing the proper candidate characters. Twenty sets of Chinese characters (5401 characters/set) are tested. The number of candidate characters is reduced from 5401 to about 80 with the error rate less than 1.2% in average. Experimental results reveal the feasibility of the proposed approach in classifying Chinese characters.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>République populaire de Chine</li>
</country>
</list>
<tree>
<country name="République populaire de Chine">
<noRegion>
<name sortKey="Chou, Kuo Sen" sort="Chou, Kuo Sen" uniqKey="Chou K" first="Kuo-Sen" last="Chou">Kuo-Sen Chou</name>
</noRegion>
<name sortKey="Fan, Kuo Chin" sort="Fan, Kuo Chin" uniqKey="Fan K" first="Kuo-Chin" last="Fan">Kuo-Chin Fan</name>
<name sortKey="Fan, Tzu I" sort="Fan, Tzu I" uniqKey="Fan T" first="Tzu-I" last="Fan">Tzu-I Fan</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 002378 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 002378 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     ISTEX:9EBC45851BF53BF96B2314C37DDCD3A03AD5B9AA
   |texte=   Peripheral and global features for use in coarse classification of Chinese characters
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024